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Abstract. This work is concerned with finite range bounds on the variance of 
individual eigenvalues of Wigner random matrices, in the bulk and at the edge of the 

*/"} ! spectrum, as well as for some intermediate eigenvalues. Relying on the GUE exam- 

ple, which needs to be investigated first, the main bounds are extended to families of 

Ph ' Hermitian Wigner matrices by means of the Tao and Vu Four Moment Theorem and 

recent localization results by Erdos, Yau and Yin. The case of real Wigner matrices 



is obtained from interlacing formulas. As an application, bounds on the expected 2- 
Wasserstein distance between the empirical spectral measure and the semicircle law 
are derived. Similar results are available for random covariance matrices. 



CO 

> 

Two different models of random Hermitian matrices were introduced by Wishart 
ON ' 
i/^ ' in the twenties and by Wigner in the fifties. Wishart was interested in modeling 

tables of random data in multivariate analysis and worked on random covariance 

matrices. In this paper, the results for covariance matrices are very close to those 

for Wigner matrices. Therefore, it deals mainly with Wigner matrices. Definitions 

and results regarding covariance matrices are available in the last section. 

Random Wigner matrices were first introduced by Wigner to study eigenvalues 

'O ■ of infinite-dimensional operators in statistical physics (see [T7j) and then propagated 

to various fields of mathematics involved in the study of spectra of random matrices. 

Under suitable symmetry assumptions, the asymptotic properties of the eigenvalues 

of a random matrix were soon conjectured to be universal, in the sense they do not 

depend on the individual distribution of the matrix entries. This opened the way to 

numerous developments on the asymptotics of various statistics of the eigenvalues 

of random matrices, such as for example the global behavior of the spectrum, the 

spacings between the eigenvalues in the bulk of the spectrum or the behavior of the 

extreme eigenvalues. Two main models have been considered, invariant matrices 

and Wigner matrices. In the invariant matrix models, the matrix law is unitary 

invariant and the eigenvalue joint distribution can be written explicitly in terms of 

a given potential. In the Wigner models, the matrix entries are independent (up to 

symmetry conditions). The case where the entries are Gaussian is the only model 

belonging to both types. In the latter case, the joint distribution of the eigenvalues 

is thus explicitly known and the previous statistics have been completely studied. 
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One main focus of random matrix theory in the past decades was to prove that these 
asymptotic behaviors were the same for non-Gaussian matrices (see for instance [1], 
[3] and [19]). 

However, in several fields such as computer science or statistics for example, 
asymptotic statements are often not enough, and more quantitative finite range 
results are required. Several recent developments have thus been concerned with 
non-asymptotic random matrix theory towards quantitative bounds (for instance on 
the probability for a certain event to occur) which are valid for all N, where N is the 
size of the given matrix. See for example [30] for an introduction to some problems 
considered in non-asymptotic random matrix theory. In this paper, we investigate 
in this respect variance bounds on the eigenvalues of families of Wigner random 
matrices. 

Wigner matrices are Hermitian or real symmetric matrices Mn such that, if M^ 
is complex, for i < j, the real and imaginary parts of (Mjv)y are independent and 
identically distributed (iid) with mean and variance |, (Mn)h are iid, with mean 
and variance 1. In the real case, (M n )ij are iid, with mean and variance 1 and 
(Mjv)m are iid, with mean and variance 2. In both cases, set Wn = ~m^N- An 
important example of Wigner matrices is the case where the entries are Gaussian. If 
Mn is complex, then it belongs to the so-called Gaussian Unitary Ensemble (GUE). 
If it is real, it belongs to the Gaussian Orthogonal Ensemble (GOE). The matrix 
Wjsr has N real eigenvalues Ai ^ ■ ■ ■ ^ A^v- In the Gaussian case, the joint law of the 
eigenvalues is known, allowing for complete descriptions of their limiting behavior 
both in the global and local regimes (see for example pQ, [3] and [T9]). 

Among universality results, at the global level, the classical Wigner's Theorem 
states that the empirical distribution L N = j^J2j =1 S\ j on the eigenvalues of Wn 
converges weakly almost surely to the semicircle law dp sc (x) = -^V4 — x 2 l[_2,2] (x)dx 
(see for example [3] for a proof in the more general setting). This gives the global 
asymptotic behavior of the spectrum. However, this theorem is not enough to de- 
duce some information on individual eigenvalues. Define, for all 1 ^ j ' ^ N, the 
theoretical location of the j th eigenvalue jj by JJ 2 dp sc (x) = -jr. Bai and Yin proved 
in [2] with assumptions on higher moments that almost surely the smallest and 
the largest eigenvalues converge to their theoretical locations, which means that 
Atv — > 2 and Ai — > —2 almost surely (see also [3] for a proof of this theorem). From 
this almost sure convergence of the extreme eigenvalues and Wigner's Theorem, it 
is possible to deduce information on individual eigenvalues in the bulk of the spec- 
trum. Indeed, according to the Glivenko-Cantelli Theorem (see for example [7]), the 
normalized eigenvalue function -kJ\f x , where M x is the number of eigenvalues which 
are in (— oo, x], converges uniformly on R almost surely to the distribution function 
of the semicircle law G (with no more assumptions on the matrix entries). Then, 
using that jrMx d — jr — G(jj) together with crude bounds on the semicircle density 
function and the fact that Xj is almost surely between —2 — e and 2 + e shows that 
almost surely Xj — jj — > uniformly for r]N ^ j ' ^ (1 — rj)N (for any fixed r\ > 0). 

At the fluctuation level, eigenvalues inside the bulk and at the edge of the spec- 
trum do not have the same behavior. Tracy and Widom showed in [28] that the 
largest eigenvalue fluctuates around 2 according to the so-called Tracy- Widom law 



i*2- Namely, 

N 2/3 (X N -2)^F 2 

in distribution as N goes to infinity. They proved this result for Gaussian matrices of 
the GUE, later extended to families of non-Gaussian Wigner matrices by Soshnikov 
(see [21] )• Recent results by Tao and Vu (see [27]) and by Erdos, Yau and Yin 
(see [S]) provide alternate proofs of this fact for larger families of Wigner matrices. 
According to this asymptotic property, the variance of the largest eigenvalue A^ is 
thus of the order of iV _4//3 . In the bulk, Gustavsson proved in [2] again for the 
GUE, that, for any fixed 77 > and all r\ N ^ j ^ (1 - 77) AT, 

' lj ^A/"(0,1) (1) 



_2]og_/V_ 



in distribution as N goes to infinity. This result was extended by Tao and Vu in 
to large families of non-Gaussian Wigner matrices. The variance of an eigenvalue Xj 
in the bulk is thus of the order of -^k - • Right-side intermediate eigenvalues consist 
in the X/s with y ^ j ^ N such that jj — > 1 but N — j — >■ 00 as N goes to 
infinity (the left-side can be deduced by symmetry). Gustavsson proved a Central 
Limit Theorem for these eigenvalues (see [H]) from which their variance is guessed 
to be of the order of N i°3( N Z''-\2/3 ■ This result was again extended to large classes of 
Wigner matrices by Tao and Vu in [27] . 

The previous results are however asymptotic. As announced, the purpose of this 
work is to provide quantitative bounds on the variance of the eigenvalues of the 
correct order, in the bulk and at the edge of the spectrum, as well as for some inter- 
mediate eigenvalues. In the statements below, M^ is a complex (respectively real) 
Wigner matrix satisfying condition (CO). This condition, which will be detailed in 
Section 12.14 provides an exponential decay of the matrix entries. Assume further- 
more that the entries of Mn have the same first four moments as the entries of a 
GUE matrix (respectively GOE). Set W N = -LM, 



N- 



Theorem 1 (in the bulk). For all < 77 ^ \, there exists a constant C(n) > such 
that, for all N ^2 and for all r]N < j ^ (1 - rj)N, 



logN 



Var(A,) < Cfa)^- . (2) 



Theorem 2 (at the edge). There exists a universal constant C > such that, for 
allN ^ 1, 

Var(Ai) < CN~ A ' 3 and Var(A iV ) ^ CN~^ 3 . (3) 

Theorem 3 (between the bulk and the edge). For all < 77 ^ ~ and for all 

K > 20a/2, there exists a constant C(r}, K) > such that, for all N ^ 2, for all 
KlogN ^j^rjN and (1 - rj)N ^j^N- KlogN, 



\og(mm(j,N-j)) 

W/ 3 (mm(j,N-j)y 



Var(A,) < C( V , /0 „,,I, , , „ 'V ( 4 ) 



It should be mentioned that Theorem [T] does not seem to be known even for 
Gaussian matrices. The first step is thus to prove it for the GUE. This will be 
achieved via the analysis of the eigenvalue counting function, which due to the 
particular determinantal structure in this case, has the same distribution as a sum 
of independent Bernoulli variables. Sharp standard deviation inequalities are thus 
available in this case. These may then be transferred to the eigenvalues in the 
bulk together with Gustavsson's bounds on the variance of the eigenvalue counting 
function. As a result, we actually establish that 

leading thus to Theorem [T] in this case. Similarly, Theorem [3] does not seem to be 
known for Gaussian matrices. The proof follows exactly the same scheme and we 
establish that 

jp[\\ \*\ s n< tz\ log(min(j,iV-j)) 

1 J V/3(min(j,iV-j)) 7 

On the other hand, Theorem [2] for the GUE and GOE has been known for some 
time (see [15]). On the basis of these results for the GUE (and GOE), Theorems [H, 
[2] and [3J are then extended to families of Wigner matrices by a suitable combination 
of Tao and Vu's Four Moment Theorem (see [26], [27]) and Erdos, Yau and Yin's 
Localization Theorem (see [S]). The basic idea is that while the localization proper- 
ties almost yield the correct order, the Four Moment Theorem may be used to reach 
the optimal bounds by comparison with the Gaussian models. Theorems [U [2] and 
[3J are established first in the complex case. The real case is deduced by means of 
interlacing formulas. Furthermore, analogous results are established for covariance 
matrices, for which non-asymptotic quantitative results are needed, as they are use- 
ful in several fields, such as compressed sensing (see [30]), wireless communication 
and quantitative finance (see [3]). 

The method developed here do not seem powerful enough to strengthen the 
variance bounds into exponential tail inequalities. In the recent contribution [21], 
Tao and Vu proved such exponential tail inequalities by a further refinement of 
the replacement method leading to the Four Moment Theorem. While much more 
powerful than variance bounds, they do not seem to yield at this moment the correct 
order of the variance bounds of Theorems [I] El and [3j 

As a corollary of the latter results, a bound on the rate of convergence of the 
empirical spectral measure L^ can be achieved. This bound is expressed in terms of 
the so-called 2-Wasserstein distance Wi between L^ and the semicircle law p sc . For 
p G [l,oo), the p-Wasserstein distance W p (n,iy) between two probability measures 
\x and v on R is defined by 



W p (fi,v) = m£l \x-y\ p dir(x,y)j 



Vp 



where the infimum is taken over all probability measure n on M 2 such that its first 
marginal is \i and its second marginal is v. Note for further purposes that the rate 



of convergence of this empirical distribution has been investigated in various forms. 
For example, the Kolmogorov distance between L^ and p sc has been considered in 
this respect in several papers (see for example [TT], [5] and [ID] ). It is given by 

d K (L N ,p sc ) = sup \j?N x - G(x)\, 

where M x is the eigenvalue counting function and G is the distribution function of the 
semicircle law. More and more precise bounds were established. For example, under 
the hypothesis of an exponential decay of the matrix entries, Gotze and Tikhomirov 
recently showed that, with high probability, 

&K\^N, Psc) ^ TT 

for some universal constant c > (see [ID])- The rate of convergence in terms of 
Wi, also called the Kantorovich- Rubinstein distance, was studied by Guionnet and 
Zeitouni in [T3] and recently by Meckes and Meckes in [ID]. In [ID], the authors 
proved that M[Wi(Ln, Ei[Ln])] < CN~ 2 ^. The following is concerned with the 
distance between Ln and p sc and strengthens the preceding conclusion on W\. 

Corollary 4. There exists a numerical constant C > such that, for all N ^ 2, 

M[W 2 (L N ,p sc )]^C^. (5) 

The proof of this corollary relies on the fact that E[W|(LjV) p sc )] is bounded 
above, up to a constant, by the sum of the expectations E[(A,- — Jj) 2 ] ■ The previously 
established bounds then easily yield the result. 

Turning to the organization of the paper, Section [H describes Theorems [H, [2] and 
[3J in the GUE case. Section [2] emphasizes to start with the Four Moment Theorem 
of Tao and Vu (see [2D] and [27]) and the Localization Theorem of Erdos, Yau and 
Yin (see [8]). On the basis of these results and the GUE case, the main Theorems 
[H [5] and [3] are then established for families of Wigner matrices. Section [3] is devoted 
to the corresponding statements for real matrices, while Section [5] describes the 
analogous results for covariance matrices. Section H] deals with Corollary H] and the 
rate of convergence of L^ towards p sc in terms of 2-Wasserstein distance. 

Throughout this paper, C(-) and c(-) will denote positive constants which depend 
on the specified quantities and may differ from one line to another. 

1 Deviation inequalities and variance bounds in 
the GUE case 

The results and proofs developped in this section for the GUE model heavily rely 
on its determinantal structure which allows for a complete description of the joint 
law of the eigenvalues (see [I]). In particular the eigenvalue counting function is 
known to have the same distribution as a sum of independent Bernoulli variables 
(see [1], pQ), whose mean and variance were computed by Gustavsson (see [14"]). 
Deviation inequalities for individual eigenvalues can thus be established, leading to 
the announced bounds on the variance. 



1.1 Eigenvalues in the bulk of the spectrum 

The aim of this section is to establish the following result. 

Theorem 5. Let M^ be a GUE matrix. Set Wn = ~m^N- For any < rj ^ \, 
there exists a constant C{rj) > such that for all N ^ 2 and all r]N ^ j ^ (1 — rj)N, 

E[\X 3 -^]^C( V )^. (6) 

In particular, 

log A 
Vav(Xj) < C( V )-^-. (7) 

As announced, the proof is based on the connection between the distribution 
of eigenvalues and the eigenvalue counting function. For every f 6 R, let J\f t = 
52»=i ^A^t be the eigenvalue counting function. Due to the determinantal structure 
of the GUE, it is known (see [I]) that At has the same distribution as a sum of inde- 
pendent Bernoulli random variables. Bernstein's inequality for example (although 
other, even sharper, inequalities may be used) may then be applied to get that for 
every u ^ 0, 



p(\Aft - E[M]| ^ u) < 2exp 



a 2 



2cr| + u 



where of is the variance of A/j (see for example [29]). Note that the upper-bound 

is non-increasing in u while non-decreasing in the variance. Set for simplicity pt = 

Psc((— oo, t]), t G R. It has been shown in [12] that for some numerical constant 

Ci>0, 

sup|E[M] -Np t \ < C x . 
teR 

In particular thus, together with (jHJ), for every -u ^ 0, 

p(|AT t -Ap i | >« + d) sC2exp(-^-^-). (9) 

As a main conclusion of the work by Gustavsson [13], for every 5 G (0,2), there 
exists cs > such that 

sup o t 2 ^ c 5 log A, (10) 

where /^ = [—2 + 5,2 — 6]. On the basis of inequalities © and (ITU]) , it is then possible 
to derive a deviation inequality for eigenvalues Xj in the bulk from their theoretical 
locations jj G [—2, 2], 1 ^ j ^ N, defined by p 7j . = jr. 

Proposition 6 (Deviation inequality for Xj). Let r\ G (0, ^] and rjN ^ j ^ (\ — rj)N. 
There exist C > 0, c > 0, c' > and 5 G (0, 2) (a// depending on rj) such that, for 
allc^u^ c'N, 

PflAi - Til ^ £) < 4exp ( - - - ^^ - ) . (11) 

V 3 ,Jl NJ h V 2c 5 hgN + CuJ K J 



Proof. Let rj E (0, |]. To start with, evaluate, for y ^ j ^ (1 — r])A^ and -u ^ 0, the 
probability P(|Aj - / y J -| > ^). We have 

TV 

= PK+*<i) 

= P(^ + * -JV* > ^7, + * - J') 

= P(^7.+f -^ T , +# > iV(p 7j + # - P 7 ,)) 

where it has been used that p 7j = 4?. Then 

P( A , > 7, + £) < P(K+# " #Aw+*l > ^K+# " Aw))- 



But 

/ } 7, + # - P7, 



p+T? 1 

= / — v 4 — a; 2 da; 

i r j+f / , 

^ _ / v 2 — xdx 

V2nJ 7j 

\/2 , f / — \3/2 



>£(2- 7i )*(l-(l 

^(2-7,) 1/2 - 



2-7, 



3tt v ,j/ TV' 
if u < (2 - 7j)iV. By definition, l - i = f 2 ±^A-x 2 dx. Then 



J , I ^ 



V2 — xdx ^ l — — ^ — / V2 — x dx. 
\/2tt 7 7j iV vr 7 7 . 

Computing the preceding integrals yields 

Therefore, u < (2 - 7j)iV if u < c'iV with c' ^ (^-f^rf^. In this case, (JE) yields 

P-v +^- — P-v ^ 2C— - 
where C > 0. Therefore 

K A ' > 1i + £) < P (K+# " ^+# I > 2Cn) 

<P(K + --^7 J+ fl>C« + Ci) 
when u ^ ^j- = c. Then, applying (EJ) leads to 

« \ / C 2 « 2 



P(^>7, + ^) ^2exp( 



2<7 2 , « + Cm 

7i+]v 



As 7j is in the bulk, there exist 5 and d < (^-) 2/3 i] 2/3 such that 7j + ^ e /<j, for 

all 7/iV ^ j ^ (1 — 77) iV and for all c ^ u ^ e/iV, for all iV ^ 1 (both 5 and c' depend 

on rf). Then 

^ / > w\ / C 2 u 2 \ 
P A, > 7, + — ^ 2exp — — - . 

Repeating the argument leads to the same bound on P(Aj < 7-,- — -^). Therefore, 
^/i> 1 u\ A ( C 2 u 2 \ 

The proposition is thus established. 

□ 

Proof of Theorem^ Note first that, for every j, 

N 



E[A}]^£)E[A?]=E['&(Wi 



j=i 



The mean of this trace can be easily computed and is equal to 2N+ ~. Consequently, 
for all N>1, 



E[A|] ^3N. (13) 

C 2 M 2 
2c s +CM 



Choose next M = M{rj) > large enough such that 2 c*+cm '■ " ^- Setting Z 

iY|A,- 7i |, 



poo 

E[Z 2 ] = / F(Z^v)2vdv 
Jo 

pc pM log N poo 

= / F(Z ^ v)2vdv+ / F(Z ^v)2vdv+ / P(Z ^ v)2vdv 

JO Jc J M log N 

^c 2 + h + I 2 . 

The two latter integrals are handled in different ways. The first one I\ is bounded 
using (II ip while I 2 is controlled using the Cauchy-Schwarz inequality and (fT3"j) . 
Starting thus with I 2 , 

p+00 
I 2 = P(Z^v)2vdv 

Jm log N 
^ TEi\Z tz^MlogN\ 



< yJlE[Z 4 ]^F(Z^ M log N) 
(If C 2 M 2 \ 



where A > is a numerical constant. As exp ( |(5 — 2 f _^ M ) log AM — > 0, there 
exists a constant C{rj) > such that 

h < Cfa). 



Turning to ii, recall that Proposition [6] gives, for c ^ v ^ c'TV, 

P(Z >v) = P( |Aj - 7,-| ^ 4 ) < 4exp ( ^ — — 

v ; V J ,J| ^ AT/ F V 2c 5 \ogN + Cv 

Hence in the range v ^ M log iV, 

B 



P(Z > v) ^4exp f 



logiV 

TnOTQ DVlcf C fllllC Q 



V 



where -B = -B(r/) = „ ? rA/ ■ There exists thus a constant C{rj) > such that 



h ^ C (rj) log N. 
Summarizing the previous steps, IE [Z 2 ~\ ^ C(rj) log N . Therefore 

which is the claim. The proof of Theorem [5] is complete. □ 

It may be shown similarly that, under the assumptions of Theorem El 

(log Nfl 2 



E[|A,-- 7i |*]<C(p,»7). 



Np 



1.2 Eigenvalues at the edge of the spectrum 



In [15] , Ledoux and Rider gave unified proofs of precise small deviation inequalities 
for the extreme eigenvalues of /3-ensembles. The results hold in particular for GUE 
matrices (/3 = 2) and for GOE matrices (/3 = 1). The following theorem summarizes 
some of the relevant inequalities for the GUE. 

Theorem 7. There exists a universal constant C > such that the following holds. 
Let Mtv be a GUE matrix. Set Wn = -4^M^ and denote by Xn the maximal 
eigenvalue oJWn- Then, for all N e N and all < s ^ 1, 

P(X N > 2(1 + 6)) < Cexp (- ^e 3 / 2 ), (14) 

and 

/ 2/V 2 \ 
P(A^ < 2(1 - e)) < C 2 exp ( - — e 3 ). (15) 

There exists also a right-tail large deviation inequality for e = 0(1) of the form 

P(\ N >2(l + e)) <;Cexp(-^ 2 ), (16) 

where C > is a universal constant. Similar inequalities hold for the smallest 
eigenvalue Ai. As stated in [TS], bounds on the variance straightly follow from these 
deviation inequalities. 

Corollary 8. Let M^ be a GUE matrix. Set Wn = ~t^^n- Then there exists a 
universal constant C > such that for all N ^ 1, 

Var(ATv) ^ ®[{X N - 2) 2 ] ^ CA^ 4/3 . 

Similar results are probably true for the k th smallest or largest eigenvalue (for 
k G N fixed). 
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1.3 Eigenvalues between the bulk and the edge of the spec- 
trum 

The aim of this section is to establish a bound on the variance for some intermediate 
eigenvalues. The proof is very similar to what was done for eigenvalues in the bulk. 
It relies on the fact that the eigenvalue counting function has the same distribution 
as a sum of independent Bernoulli variables due to the determinantal properties 
of the eigenvalues. A deviation inequality for individual eigenvalues can then be 
derived and the bound on the variance straightly follows. Parts of the proof which 
are identical to the proof for eigenvalues in the bulk will be omitted. In what follows 
we only consider the right-side of the spectrum. Results and proofs for the left-side 
can be deduced by replacing N — j by j. The precise statement is the following. 

Theorem 9. Let M N be a GUE matrix. Set W N = ^M N . For all K ^ 20^ and 

for all rj G (0, |], there exists a constant C(rj, K) > such that for all N ^ 2 and 
all(l-r})N ^j ^ N-K\ogN, 

E[|A,- 7 /HC(,,X) ^;^ /3 . (17) 



In particular. 



Var(A J ) KC(rj,K) 



JV4/3(JV-j)2/3- 



The preceding theorem does not concern all intermediate eigenvalues since N — j 
has to be at least of the order of 20v / 21og N for the method used here to yield the 
correct order on the variance. This restriction seems however to be technical since 
Gustavsson p3] proved that a Central Limit Theorem holds for all eigenvalues Xj 
such that N — j — > oo but — ^ — > 0. From this CLT, the variance of such an 
individual eigenvalue A.,- is guessed to be similarly of the order of ^ 4 °f,'^~-iv 2/3 for this 
range. 

As for eigenvalues in the bulk, the proof relies on the deviation inequality for 
the eigenvalue counting function §§§. From this and a bound on the variance of 
this counting function (fTUj) . it was then possible to derive a deviation inequality for 
eigenvalues in the bulk. The work of Gustavsson [13] suggests that for all < fj < 4 
and for all K > 0, there exists a constant c^^- > such that the following holds. 
For every sequence (tA^iVeN such that, for all N, < 2 — tisi ^ f] and iV(2 — t^) 3 / 2 ^ 

/nogiv, 

Similarly to the bulk case, the following proposition can be established. 



.,^c~ k log(N(2-t N ) 3 / 2 ). (19) 



Proposition 10 (Deviation inequality for intermediate Xj). There exist universal 
positive constants C and c such that the following holds. Let K > 20\/2 and rj E 
(0, =]. Set (1 — rj)N ^ j ^ N — KlogN. There exists C > and d > depending 
on K and rj such that for all c ^ u ^ d(N — j), 

r {\ X '-^> NV H N-j)^ ) < 4eXP (- C qo g (f- 2 j) + cJ ' (20) 

10 



Proof. Set C = 2^ 5 / 6 (3tt)- 2 / 3 . Let K > 20^ and rj G (0, \}. Take a G (^, l) and 
set d = a(f ) 2/3 . For (l-^)iV < j < A^-Klog A^ and u > 0, set UjVii = N 2 /H £_ j)1/3 . 
As in the proof of Proposition El evaluating the probability P(|Aj — 7 j| > u^j) yields 

p(Aj > 7; + Wtfj) < ^(I^jj+unj ~ Np yj+UN J > N{p lj+UNj - p 7j )). 

But, if u < A^ 2 / 3 (A^ - j) 1/3 (2 - 7,-), 

p 7j+UjVj - p 7j ^ —(2 - 7 i ) 1/2 M 7V , i . 

Similarly to the proof of Proposition EJ this condition holds if u ^ c'(N — j). In this 
case, 

p(Aj > 7j + un,j) < P(\M 7j+UNJ - N Pyj+UN J >Cu + C l ), 
when u ^ c = -^f. Then, applying (EJ leads to 

CV 



P f Aj > 7j + mjvj j ^ 2 exp f 



20 7i-H»Nj + Cu 



for all c ^ w ^ d{N — j). Gustavsson's result ( TT9l) gives a bound on a 2 . +UN . . Set 

tjv = 7j + Mjvj. As j > (1 - 7/)iV and u ^ 0, ^ 2 - t N < 2 - 7i < (^??) 2/3 = ^ for 
all N. Moreover, 

AT(2 - t N f' 2 = N{2 - 7i - MjVj ) 3/2 

3tt , c'(N- jf^ y/i 

^—(l- a fl^K\ogN 
^ KlogN, 
where K = '^-(l - af /2 K > 0. From (USD, for all c ^ u ^ c'(N - j), 

Var(A/; j+UjV J < c^log (iV(2 - t N f 2 ). 
But 

AT(2 - t N f 2 = N(2 - 7j - M7VJ ) 3/2 
< AT(2 - 7i ) 3/2 

Hence log (N(2 - t N ) 3 / 2 ) < log(iV - j) + log(^). As AT > 20^ and AT > 2, 
AT -j ^ AT log N > ^L and Var(A/' 7j+U]Vj ) ^ 2c^log(JV - j). Therefore 

C 2 u 2 



P ( Aj > 7j + m^ j j < 2 exp f — 



A Cfj>R hg(N - j) + Cw 
The proof is concluded similarly to Proposition |6j □ 
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On the basis of Proposition HOI we may then conclude the proof of Theorem [9j 
Proof of Theorem^ Setting Z = N 2 / 3 (N - j) 1/3 \\j - 7^, 

E[Z 2 



= 


poo 

/ P(Z^v)2vdv 
Jo 










= 


/ P(Z ^ v)2v dv+ j 


-i) 


P(Z^ 


v)2vdv 




+ 


rc'(N-j) , 

/ P(Z^v)2vdv + / 

y^iog(iv-j) 7c 


'OO 

'(AT 


P(Z ^ i))2d 
-j) 


dv 


^ 


c 2 + Ji + J 2 + J 3 . 











Repeating the computations carried out with I2 in the proof of Theorem [5] yields 

L < 2AN ll ' & (N - i\ 2 / 3 p™( - l C 2 c' 2 (N-j) 2 \ 

J 3 ^2AN (TV j) exp^ g ^^ _ ^ + ^^ _ ^ j, 

where A > is a numerical constant. For N large enough (depending on 77 and K), 
C log(N - j) ^ Cc'(N - j) and 



J 3 <; 2AiV 5 / 2 exp ( - — (N-j)). 



Then, as N - j ^ AT log N, 

J 3 < 2 AN 5 / 2 exp ( - ^^ log N 

But K( ^ c = -^-7= > |. The right-hand side goes thus to when N goes to infinity. 
As a consequence, there exists a constant C(?7, if) > such that 

J 3 ^C( Vl K). 

The integral J\ is handled as I±, using that, in the range v ^ ^- log(Af — j), 

P(Z > ,)<4exp(- i - I |- y ^), 

(this is due to I 
constant C(i], K) such that 



where B = B(r],K) = ^ (this is due to Proposition [TO]) . Hence, there exists a 



J^CfaK^ogiN-j). 

Finally, J2 is handled similarly. From Proposition [101 i n the range ^ log(A/" — j) ^ 
v^c'(N-j), 

P(Z^ v) ^4exp f - — v). 
Thus 



»c'(N-j) C \ f 00 ( C 

exp ( v ) 2v dv ^ 4 / exp ( 1> )2t> c/w. 

^log(AT-j) V 2 7 J \ 2 
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Then J 2 is bounded by a constant, which is independent of rj and K. There exists 
thus a constant C > such that 

J 2 ^C. 

Summarizing the previous steps, E[Z 2 ] ^ C(r), K) log(iV — j). Therefore 

!E[lA J -- 7j H<Cfa J f) jv ^ff;^ / 3 , 
which is the claim. D 

2 Variance bounds for Wigner Hermitian matri- 
ces 

As announced, the goal of this section is to prove Theorems HJ [2] and [3] for Wigner 
Hermitian matrices. The eigenvalues of a Wigner Hermitian matrix do not form 
a determinantal process. Therefore it does not seem easy to provide deviation in- 
equalities for the counting function and for individual eigenvalues. However the 
sharp non-asymptotic bounds established in the Gaussian case can still be reached 
by a comparison procedure. 

2.1 Localization of the eigenvalues and the Four Moment 
Theorem 

Two main recent theorems will be used in order to carry out this comparison proce- 
dure. First, Erdos, Yau and Yin proved in jH] a Localization Theorem which gives 
a high probability non-asymptotic bound on the distance between an eigenvalue \j 
and its theoretical value 7^-. Secondly, Tao and Vu's Four Moment Theorem (see 
[2E] and [2Z]) provides a very useful non-asymptotic bound on the error made by ap- 
proximating a statistics of the eigenvalues of a Wigner matrix by the same statistics 
but with the eigenvalues of a GUE matrix. 

Let Mtv be a Wigner Hermitian matrix. Say that Mn satisfies condition (CO) 
if the real part £ and the imaginary part £ of (Mjv)^ are independent and have an 
exponential decay: there are two positive constants Bi and B 2 such that 

P(|£| >t Bl ) ^e _t and P(|f| > t Bl ) < e~* 

for all t ^ B 2 . 

Theorem 11 (Localization [8]). Let Mn be a random Hermitian matrix whose 
entries satisfy condition (CO). There are positive universal constants c and C such 
that, for any 1 ^ j ^ N , 

p(j Aj - 7j | ;> (logiV) cl ^ lo ^iV- 2 / 3 minO-,iV + l-j)- 1/3 ) < Ce-^ N ^° £l ° sN . (21) 
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This a strong localization result and it almost yields the correct order on the 
bound on the variance. Indeed, by means of the Cauchy-Schwarz inequality and 
( TT3l) . it can be shown that, for rj N ^ j ^ (1 — rf)N, 

Var(A,) < E[(A, - 7j ) 2 ] < <M (log N) c l °* l °* N . 

In Tao and Vu's recent paper [21] on deviation inequalities, the authors proved a 
more precise localization result. They indeed established a bound similar to (I2ip 
but with (\ogN) A instead of (log A r ) cloglogAr (where A > is fixed). However this 
more precise bound is of no help below, as the final bounds on the variances remain 
unchanged. 

We turn now to Tao and Vu's Four Moment Theorem, in order to compare Wn 
with a GUE matrix W' N . Say that two complex random variables £ and £' match to 
order k if 

E[3«(0 m S(0 z ] =E[3£(O m S(01 
for all m, I ^ such that m + I ^ k. 

Theorem 12 (Four Moment Theorem [26], [27]). There exists a small positive con- 
stant cq such that the following holds. Let Mn = (Cii)i<i,i<iV an d M' N = (£i,-)i^*>i<-W 
be two random Wigner Hermitian matrices satisfying condition (CO). Assume that, 
for 1 ^ i < j ^ N, £ij and £' t • match to order 4 and that, for 1 ^ i ^ N, £a and 
& match to order 2. Set A N = VNM N and A' N = VNM' N . Let G : E ->■ R 6e a 
smooth fonction such that: 

V ^ K 5, VieR, |G (fc) (x)| ^iV C0 . (22) 

TTien, /or all 1 ^ i ^ N and for N large enough (depending on constants B\ and 
B 2 in condition (CO)), 

\E[G(\i(A N ))] -E[G(Xi(A' N ))] \ ^ N~ c °. (23) 

Actually Tao and Vu proved this theorem in a more general form, involving 
a finite number of eigenvalues. In this work, it will only be used with one given 
eigenvalue. See [26] and [21] for more details. 

It should be mentionned that Tao and Vu extended in [26] Gustavsson's result 
(see equation (IT])) via this theorem. By means of a smooth bump function G, they 
compared the probability for Xj to be in a given interval for a non-Gaussian matrix 
with almost the same probability but for a GUE matrix. Applying this technique 
to P(|Aj — jj\ > fj) in order to extend directly the deviation inequality leads to 
the following in the general Wigner case: for all rjN ^ j ^ (1 — rj)N and for all 
C' < u < c'N, 



P(l*i - 7il > V\ < Cex V ( - -r^T ) + O(N~ C0 

V 3 Jl NJ V close AT + u) v 



NJ \ clogN + u 

where C, C', c and c' are positive constants depending only on rj. The bound is not 
exponential anymore and is not enough to conclude towards sharp bounds on the 
variance or higher moments. 
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2.2 Comparison with Gaussian matrices 

Let Mn be a Hermitian Wigner matrix and M' N be a GUE matrix such that they 
satisfy the hypotheses of Theorem [12j As the function G : x G R i— > x 2 does not 
satisfy (I2"2"j) . Theorem [T2l will be applied to a truncation of G. Theorem [TT] will 
provide a small area around the theoretical location jj where the eigenvalue Xj is 
very likely to be in, so that the error due to the truncation will be well controlled. 
Note that this procedure is valid for eigenvalues in the bulk and at the edge of the 
spectrum, as well as for intermediate eigenvalues. 

Let 1 < j < N. Set R% ] = (\ogN) clo ^°^ N N 1 / 3 mm(j, N + 1 - j)" 1/3 and 
Q e -(iogN) c °s °s _ Then Theorem [Til leads to: 



EjV 



P 



(l^-»i>f) 



< 



£n- 



(24) 



Let ^ be a smooth function with support [—2,2] and values in [0,1] such that 
ip{x) = jyX 2 for all x G [— 1; 1]. Set Gj : x G R H ?• ^(— nr 1 )- We want to apply 

Tao and Vu's Four Moment Theorem [T2l to Gj. As ip is smooth and has compact 
support, its first five derivatives are bounded by M > 0. Then, for all ^ k ^ 5, 
for all x G R, 



|Gf(x)| < 



M 



(J$)* 



< 7V C0 , 



-co 



where the last inequality holds for iV large enough (depending only on M and Cq 
Then, the Four Moment Theorem [12] yields: 

for large enough N. But 



(25) 



B E 



Aj(Ajv)-Ar7j 



R 



0) 



l^j(^JV)-^7jl 



,00 



il 



JV 2 



10 



0#) 



-E 



(A, - 7,) 2 1 



,0) 



1^-7,-K-f- 



E 



E 



(jj ( A j (A Jv) ) 1 IXjjA^-N-tj 



73T 



>l 



Gj(Xj(A N ))t \X j (A N )-N lj 



,00 



>1 



On the one hand, 



E 



jj) 



n' 



0) 



GjjXiiAjf))! ^-^ ^P(\X j (W N )- 7j \>^ <e N . 



On the other hand, 



E 



(A, - 7i) 2 l R U) 

|A*-7il^ 

From the Cauchy-Schwarz inequality and ([[3 



EK^-Tifl-E (A,- 7j ) 2 l 



|A J -7 J I> : 



,00 



E 



(Ai-7,) 2 1 



\Xj-jj\>- 



,oo 



^ ^E[(A i -7i) 4 ]P(|A i -7il>#- 
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where A > is a numerical constant. Then 

iV 2 



BIG&Mn))] = — -^(E[(A i - 7i ) 2 ]H-0(iV 1 / 2 4/ 2 ))+0( £j v) 



AT 






10«) 



U)\ 2 



E[(A,- 7i ) 2 ] +0(N^e]i 2 (R^) ~ z )+0(e N ). 



Repeating the same computations gives similarly 



N 



2 



E[G 3 (X 3 (A'n))} = TZT^sHty ~ 7,) 2 ] + 0(iV 5/2 4 /2 WO ) + 0(e N ). 
10 { r n) 

Then (J2HD leads to 

e^.-^He^.-^+^a^ 

As the first two error terms are smaller than the third one, the preceding equation 
becomes 

E[(A, - 7j ) 2 ] = E[(A; -7i) 2 ] +0((R^) 2 N-^). (26) 

2.3 Combining the results 

We distinguish between the bulk, the edge and the intermediate cases. Note that 
the constants C{rj) and C depend on the constants Bi and B 2 in condition (CO). 



N 
N 2 



2.3.1 Eigenvalues in the bulk of the spectrum 

Let < 7] ^ \ andrjN ^ j ^ (l-n)N. From Theorem El E[(A^-7 i ) 2 ] ^ C{r])^§ 
Thus, from ( 1261) . it remains to show that the error term is smaller than -j^-- But 

R$ = (logA^) closlog7V A^ 1/3 min(i, N + 1 - j)~ l/3 ^ rT 1/3 (log A0 cloglogJV . 

Then (i?^) 2 ^-^" 2 = o^ 1 ^). As a consequence, 

E[(A i - 7i ) 2 ]=E[(A;-7i) 2 l+a l| (!^) 

and we get the desired result 

\ogN 



E[(A j -7i) a ]<C(t7): 



AT2 



2.3.2 Eigenvalues at the edge of the spectrum 

From Corollary El E[(\' N - 7iV ) 2 ] = E[(X' N - 2) 2 ] < CN~ 4 / 3 . By means of (I2"B1) ; it 
remains to prove that the error term is smaller than A^" 4 / 3 . We have 

rW = (log A r)Ciogio g JV iV i/3_ 
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Consequently (R^fN^^ 2 = o(N~*/ 3 ). Then 

E[(A JV - 2) 2 ] = E[(A5v - 2) 2 ] + o(iV- 4 / 3 ) 

and 

E[(Ajv-2) 2 ] ^CiV- 4 / 3 . 

As for Gaussian matrices, the same result is available for the smallest eigenvalue Ai. 

2.3.3 Eigenvalues between the bulk and the edge of the spectrum 

Let < 77 < |, K > 20^ and (1 - r/)N ^ j ^ N - KhgN. From Theorem El 
E[(A$ - 7j) 2 ] < C{r),K) N lyt^Z%/3 - Thus ' from fl2B), it remains to show that the 
error term is smaller than 4 ° 3 g .' J J x 2 /3 • But 



i$> = (logiV) cl ^ lo s Ar iV 1 / 3 (iV+ 1 - j)" 1/3 - 

log(JV-j) ' 

Af 4 /3(AT_j)2/3. 



Then (i?g ) )V" c »- 2 = o( N ^y. ) /a ). As a consequence 



E[(A i - 7j ) a ]=E[(A;- 7^+0( ^:^3 



and we get the desired result 

EKAy-Ti^CfoJO- 



io g (iv-i; 



JV 4 /3(JV - j)2/3" 

A similar result holds for the left-side of the spectrum. 

3 Real matrices 

The goal of this section is to prove Theorems [H [2] and [3] for real Wigner matrices. 
Tao and Vu's Four Moment Theorem (Theorem [T2"]) as well as Erdos, Yau and Yin's 
Localization Theorem (Theorem [TTj) still hold for real Wigner matrices. Section [2] 
is therefore valid for real matrices. The point is then to establish the results in the 
GOE case. 



As announced in Section 11.21 the variance of eigenvalues at the edge of the spec- 
trum is known to be bounded by iV" 4 / 3 for GOE matrices (see [IS])- The conclusion 
for the smallest and largest eigenvalues is then established for large families of real 
symmetric Wigner matrices. 

C C 

Var(AAr) ^ — -jr and Var(Ai) < 



AT4/3 •"" V"i7 ^ ^4/3- 

For eigenvalues in the bulk of the spectrum, O'Rourke proved in [18] a Central 
Limit Theorem which is very similar to the one established by Gustavsson in [H] . 

In particular, the normalisation is still of the order of ( ^2 ) and differs from the 
complex case only by a constant. It is therefore natural to expect the same bound on 
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the variance for GOE matrices. The situation is completely similar for intermediate 
eigenvalues. But GOE matrices do not have the same determinantal properties as 
GUE matrices, and it is therefore not clear that a deviation inequality (similar to 
(jSJ)) holds for the eigenvalue counting function. However, as explained by O'Rourke 
in [TH], GOE and GUE matrices are linked by interlacing formulas established by 
Forrester and Rains (see [2j). These formulas lead to the following relation between 
the eigenvalue counting functions in the complex and in the real cases: for all tGR, 

M(WJf) = \{mW^+Ar t (W^)+( N (t), (27) 

where W§ = ^M% and M^ is from the GUE, W% = ^=M$, W§ = ^M^ and 

Mff and M^ are independent matrices from the GOE and Cjv(^) takes values in 
{— 1, — |, 0, |, l}. See [IB] for more details. 

The aim is now to establish a deviation inequality for the eigenvalue counting 
function similar to fl9l). From fl9l), we know that for all u > 0, 



V(Wt(W%)-Npt\ >u + C x ) <2exp( 

Set C[ = C\ + 1 and let u ^ 0. We can then write 
P(N t (W%)-Np t >u + C[) 2 



u 2 



1o\ + u 



= P (M(W^) - N Pt > u + C[, M(W^) -Np t ^u + C[ 

< p(|(M(w*) + M(W*)) - 7V> t ^ « + C[ 
<P(M(WS)-iNT ft >w+Ci-l) 

^ 2exp 



u 2 



2<7 t 2 + u, 

Repeating the computations for P(A/" i (M / ^) — iVp t ^ — u — C[) and combining with 
the preceding yield 

2 
P(|M(W$) - AT A | ^ « + CJ) < 2V/2 6XP ( - 4g2 " ) • (28) 



'1 



Note that of is still the variance oi J\f t (W^) in the preceding formula. 

What remains then to be proved is very similar to the complex case. From (12 8 j) 
and Gustavsson's bounds on the variance of (see ( fTOl) for the bulk case and (I19J1 
for the intermediate case), deviation inequalities for individual eigenvalues can be 
deduced, as was done to prove Propositions [U] and HUJ It is then straightforward to 
derive the announced bounds on the variances for GOE matrices. The argument de- 
veloped in Section [2] in order to extend the GUE results to large families of Hermitian 
Wigner matrices can be reproduced to reach the desired bounds on the variances 
of eigenvalues in the bulk and between the bulk and the edge of the spectrum for 
families of real Wigner matrices. Then there exists a constant C{rj) > such that 
for all r]N^j^(l- rj)N, 

Var^KE^-^) 2 ]^^)^, 
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and there exists a constant C(r), K) > such that for all (1 — r])N ^ j ^ N — K log N 
(and similarly for the left-side of the spectrum), 

Var(A,)^E[(A,-7i) 2 ] ^C( V ,K)- 



N^(N-j) 2 / 3 ' 

4 A corollary on the 2-Wasserstein distance 

The bounds on the variances, more exactly on E[(Aj- — 7j) 2 ], developed in the pre- 
ceding sections lead to a bound on the rate of convergence of the empirical spectral 
measure L^ towards the semicircle law p sc in terms of 2-Wasserstein distance. Recall 
that W2(Ln, Psc) is a random variable defined by 

1/2 

W 2 (L N ,p sc ) =inf f / \x-y\ 2 dir(x,y) 
\Jr 2 

where the infimum is taken over all probability measures n on M 2 with respective 
marginals L^ and p sc . To achieve the expected bound, we rely on another expression 
of Wi in terms of distribution functions, namely 



W 2 2 (L N ,p sc ) = [ (F^ 1 (x)-G- 1 (x)) 2 dx, 
Jo 



(29) 



where F^ 1 (respectively G" 1 ) is the generalized inverse of the distribution function 
Fn (respectively G) of Ln (respectively p sc ) (see for example [31]). On the basis of 
this representation, the following statement may be derived. 

Proposition 13. There exists a universal constant C > such that for all N ^ 1, 

2 N G 

W'(L NjPsc ) < - ]>>, - 7j ) 2 + — . (30) 



Proof. From 



W 2 2 (L N ,p sc ) = [ {F^{x)-G-\x)) 2 dx. 
Jo 



Then, 



N „± 



W 2 (L N ,p sc ) = J2 [^ {\j-G-\x)) 2 dx 

3=1 jL ¥T 
„ N N ± 

„■ — 1 „■ — 1 J — rr- 



j=l j=l " N 

But jj = C _1 (^) and G~ l is non-decreasing. Therefore, |tj — G~ l (x)\ ^ 7j — 7j_i 
for all x e \_^jf, jj\ ■ Consequently, 

2 N 2 N 

W 2 2 (L N , Psc ) ^ - £(Aj - 7 ,) 2 + W J>i ~ ^■- 1 ) 2 - ( 31 ) 

j=i i=i 
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But if j — 1 ^ y (and therefore 7j_i ^ 0) 

i n> i 



V. 27r 



a/4 — x 2 da; 
^ / V2 — xdx 



>^(2-7i-i) 1/a (7i"7i 
y/2/3irN- j + 1 \ i/3 



-i> 



^ly^M (w-J> 



from (fT2"j) . Then 



lj 7i_1 ^iV 2 /3(JV-i+ l) 2 /3" 

It may be shown that a similar bound holds if j — 1 ^ ~. As a summary, there 
exists a universal constant c > such that, for all j ^ 2, 



c 

7i ~ 7i_1 ^ iV 2 / 3 min(j,iV + l-j) 1/3 ' (32) 



This yields 



N J, N 



Zjrij 7i-l) < ^4/3 Z^ min (^ AT + 1 _ j)2/3 ^ AT' 

where C > is a universal constant. Then (1311) becomes 



2 * (7 

^ 2 2 (^,p S cK^]T(A,- 7j ) 2 + — , 

i=i 
where C > is a universal constant, which is the claim. □ 

Proof of Corollary^ Let N ^ 2. Due to Proposition [131 

2 * C 

E[W*(L N ,p sc )] <_^E[(A i -7i) a ]+T™. 

i=i 
We then make use of the bounds on E [(A.,- — 7j) 2 ] produced in the previous sections. 
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Set 7] G (0, |] and K > 20\/2 so that Klog N < r^N. We first decompose 

AT A' log AT rjJV 

j>[(A,- 7 i) 2 ] = E E[(A j -7i) a ]+ E E K^-Ti) 2 ] 
i=i i=i j=xio g Af+i 

(l-»y)JV-l A-KlogAT-l 

+ E E[(A,- 7i ) 2 ]+ E E t( A i-7i) 2 ] 

j =rl N+l j=(l-rf)N 

N 

+ e E [( A i-7,-) 2 ] 

j=N-KlogN 

The sum E 3 will be bounded using the bulk case (Theorem [T]), while Theorem [2] 
will be used to handle E 2 and E 4 . A crude version of Theorem [TT] will be enough to 
bound Si and E 5 . To start with thus, from Theorem [T], 



(l-rj)N-l 

E 

j=T)N+l 



logN 



E c ( ,)^ <C (,) 



Secondly, from Theorem [31 



E 4 < 



AT4/3 



E 

i=AT log AT+1 



^ C{i],K)- 



J 



2/3 



iV 



Next Si and E5 have only K log A" terms. If each term is bounded by ^ where C 

is a positive universal constant, we get that Ei + E 5 ^ ^ — , which is enough to 

prove the desired result on Y^j=i E[(Aj — 7j) 2 ] . For A" large enough depending only 
on constant C in Theorem HH ^±= ^ jvi/l^^^i/s and Theorem E] yields 

V J J a/at/ 



Then, by the Cauchy-Schwarz inequality, 

E[(A, - 7 ;) 2 H 4 (A ^ " ^ )2 V-7,K 



+ E 



(Xj - 7i) 2 l|A,- 7j |; 



■/iv. 






As y^CN^e-V *") 
7j ) 2 ] < § . Then 



c log log iV 



• AT 



if), there exists a constant C > such that E[(Aj- — 
logN 



Ex + E 5 < 2KC- 



N 
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As a consequence, 

3=1 

Therefore 

E[W 2 (L N , Psc ) 2 ]^C^, 

where C > is a universal constant, which is the claim. The corollary is thus 
established. □ 

The preceding Corollary H] implies that 



E[Wi(L N)Pae j\ ^C^^. 

This is an improvement of Meckes and Meckes' rate of convergence obtained in 
[T6j . Note however that the distance studied in [16] is the expected 1-Wasserstein 
distance between Ljy and its mean instead of p sc . The rate of convergence in 1- 
Wasserstein distance can be furthermore compared to the rate of convergence in 
Kolmogorov distance. Indeed, if \x and v are two probability measures on R such 
that v has a bounded density function with respect to Lebesgue measure, dx( P , v) ^ 
C\/Wi((x, v), where c > depends on the bound on the density function. This is 
due to the fact that 



Wx(ji,v)= sup ( / fd P - / fdu\ 

/l-Lipschitz \ JR Jm. / 



(see for example [21]) and dx( P , v) = sup^gjg |/i((— oo, x\) — z/((— oo, x]) |. Approxi- 
mating l(_oo x i from above and from below by ^-Lipschitz functions and optimizing 
on e gives the result. Therefore, the preceding implies that 

floeA0 1/4 
E[d K (L N , Pae )] ^c K 2—L 

which is however far from Gotze and Tikhomirov's recent bound pH] 

(log A^) c 



E[d K (L N ,p s 



< 



N 



5 Eigenvalue variance bounds for covariance ma- 
trices 

This section provides the analogous non-asymptotic bounds on the variance of eigen- 
values for covariance matrices. The proofs will be detailed in another redaction. 
Therefore, this section contains only the background and the results. 

Random covariance matrices are defined by the following. Let X be a m x n (real 
or complex) matrix, with m ^ n, such that its entries are independent, centered and 
have variance 1. Then S mtn = -X*X is a random covariance matrix. An important 
example is the case when the entries of X are Gaussian. Then S m>n belongs to the 
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so-called Laguerre Unitary Ensemble (LUE) if the entries of X are complex and to 
the Laguerre Orthogonal Ensemble (LOE) if they are real. S m ^ n is Hermitian (or 
real symmetric) and therefore has n real eigenvalues. As m ^ n, none of these 
eigenvalues is trivial. Furthermore, these eigenvalues are nonnegative and will be 
denoted by ^ Ai ^ ■ ■ • ^ X n . 

Similarly to Wigner's Theorem, the classical Marchenko-Pastur theorem states 
that, if — — > p ^ 1 when n goes to infinity, the empirical spectral measure L m ^ n = 
- Y^=i ^Aj converges almost surely to a deterministic measure, called the Marchenko- 
Pastur distribution of parameter p. This measure is compactly supported and is 
absolutely continuous with respect to Lebesgue measure, with density 



dp M P( P )(x) = -—y (b P -x)(x- a p )t [apM (x)dx, 

where a p = (1 — ^fp) 2 and b p = (1 + ^fp) 2 (see for example [3]). Two different 
behaviors arise according to the value of p. If p > 1, a p > is a soft edge, which 
means that an eigenvalue Xj can be larger or smaller than a p . On the contrary, if p = 
1, a p — is called a hard edge: no eigenvalue can be less than a p . Furthermore, the 
Marchenko-Pastur density function explodes at 0. It is the case in particular when 
m = n. We will denote by p m ,n the approximate Marchenko-Pastur distribution 
whose density is defined by 



t^m,n\X) 

L1XX 



— J(x - a m , n )(b m>n - x)l [am ;b ]{x), 



with a m , n = (1 - a/ f) and 6 m>n = (H 

The asymptotic behaviors of individual eigenvalues for LUE matrices have been 
known for some time and extended to more general covariance matrices in the last 
decade. For an eigenvalue in the bulk of the spectrum, i.e. Xj such that r\n ^ j ^ 
(1 — rj)n, a Central Limit Theorem was proved by Su (see [22]) and Tao-Vu (see 
[23]). From this theorem, the variance of such eigenvalues is guessed to be of the 
order of -^P. For right-side intermediate eigenvalues, i.e. A,- such that - — > 1 and 
n — j — > oo, Su, and later Tao-Vu and Wang, proved a CLT (see [22], [23] and 



The variance appears to be of the order of n4 °^~i 2 /3 ■ Similar results probably hold 
for the left-side of the spectrum, when p > 1. Finally, for the smallest and the 
largest eigenvalues Ai and A n , CLTs were proved for Gaussian matrices by Borodin- 
Forrester (see [6]). Several authors extended these results to more general covariance 
matrices. The latest results are due to Tao-Vu and Wang (see [23] and [32])- Their 
variances are then guessed to be of the order of n -4//3 . It should be mentionned 
that the result for the smallest eigenvalue only holds when p > 1. When p = 1, in 
the case of a squared matrix, Edelman proved a CLT for the smallest eigenvalue Ai, 
extended by Tao and Vu in [25], from which the variance is guessed to be of the 
order of n~ A . 

The following statement summarizes a number of quantitative bounds on the 
eigenvalues of covariance matrices which are proved by methods similar to the ones 
developed in the preceding sections. For simplicity, we basically assume that p > 1. 
More precisely, we assume that 1 < A\ ^ — ^ A2 (where A\ and A2 are fixed 
constants) and that S m , n is a covariance matrix whose entries have an exponential 
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decay (condition (CO)) and have the same first four moments as those of a LUE 
matrix. 

Theorem 14. 1. In the bulk of the spectrum. 

Let r) G (0, |]. There exist a constant C > (depending on r\, A\ and A 2 ) such 
that for all covariance matrix S m ^ n , for all rjn ^ j ^ (1 — rj)n, 

lose n 
Var A,-) ^ C-^f. 
n 2 

2. Between the bulk and the edge of the spectrum. 

There exists a constant k > (depending on A\ and A 2 ) such that the following 
holds. For all K > k, for all r\ G (0, |] ; there exists a constant C > (de- 
pending on K , rj, A\ and A 2 ) such that for all covariance matrix S m ^ n , for all 
(1 — t])n ^ j ^ n — Klogn, 



n 



4/3(„_^2/3- 



3. At the edge of the spectrum. 

There exists a constant C > (depending on A\ and A 2 ) such that, for all 



covariance matrix S m , n , 



Var(A n ) ^ Cn~^. 



As for Wigner matrices, the first two results of this theorem are first proved 
in the Gaussian case, using the fact that the eigenvalues of a LUE matrix form 
a determinantal process. It is then possible to derive a deviation inequality for 
the eigenvalue counting function and then for individual eigenvalues. Integrating 
leads to the results for LUE matrices. The result for the largest eigenvalue in the 
Gaussian case is already known, see [15]. These bounds are then extended to non- 
Gaussian matrices, relying on three recent papers. First, Pillai and Yin proved in 
[2U] localization properties for individual eigenvalues of covariance matrices, very 
similar to the localization properties for Wigner matrices established by Erdos, Yau 
and Yin in [8]. Secondly, Tao and Vu (see [23]) and later Wang (see [32]) proved a 
Four Moment Theorem for these matrices. Combining these theorems as for Wigner 
matrices yield the theorem. 
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